D$^2$: Decentralized Training over Decentralized Data
نویسندگان
چکیده
While training a machine learning model using multiple workers, each of which collects data from their own data sources, it would be most useful when the data collected from different workers can be unique and different. Ironically, recent analysis of decentralized parallel stochastic gradient descent (D-PSGD) relies on the assumption that the data hosted on different workers are not too different. In this paper, we ask the question: Can we design a decentralized parallel stochastic gradient descent algorithm that is less sensitive to the data variance across workers? In this paper, we present D2, a novel decentralized parallel stochastic gradient descent algorithm designed for large data variance among workers (imprecisely, “decentralized” data). The core of D2 is a variance blackuction extension of the standard D-PSGD algorithm, which improves the convergence rate from O ( σ √ nT + (nζ 2) 1 3 T2/3 ) to O ( σ √ nT ) where ζ2 denotes the variance among data on different workers. As a result, D2 is robust to data variance among workers. We empirically evaluated D2 on image classification tasks where each worker has access to only the data of a limited set of labels, and find that D2 significantly outperforms D-PSGD.
منابع مشابه
Economic Droop Scheme for Decentralized Power Management in DC Microgrids
This paper proposes an autonomous and economic droop control scheme for DC microgrid application. In this method, a cost-effective power sharing technique among various types of DG units is properly adopted. The droop settings are determined based on an algorithm to individually manage the power management without any complicated optimization methods commonly applied in the centralized control ...
متن کاملA MULTI-OBJECTIVE DECENTRALIZED MULTIPLE CONSTRUCTION PROJECTS SCHEDULING PROBLEM CONSIDERING PERIODIC SERVICES AND ORDERING POLICIES
In decentralized construction projects, costs are mostly related to investment, material, holding, logistics, and other minor costs for implementation. For this reason, simultaneous planning of these items and appropriate scheduling of activities can significantly reduce the total costs of the project undertaken. This paper investigates the decentralized multiple construction projects schedulin...
متن کاملPerformance Evaluation of Supply Chain under Decentralized Organization Mechanism
Abstract Nowadays among many evaluation methods, data envelopment analysis has widely used to evaluate the relative performance of a set of Decision Making Units (DMUs). Data Envelopment Analysis (DEA(is a mathematical tool for evaluating the relative efficiency of a set Decision Making Units (DMUs), with multiple inputs and outputs. Traditional DEA models treat with each DMU as a “black box" t...
متن کاملThe Expected Achievable Distortion of Two-User Decentralized Interference Channels
This paper concerns the transmission of two independent Gaussian sources over a two-user decentralized interference channel, assuming that the transmitters are unaware of the instantaneous CSIs. The availability of the channel state information at receivers (CSIR) is considered in two scenarios of perfect and imperfect CSIR. In the imperfect CSIR case, we consider a more practical assumption of...
متن کاملDecentralized prognosis of fuzzy discrete-event systems
This paper gives a decentralized approach to the problem of failure prognosis in the framework of fuzzy discrete event systems (FDES). A notion of co-predictability is formalized for decentralized prognosis of FDESs, where several local agents with fuzzy observability rather than crisp observability are used in the prognosis task. An FDES is said to be co-predictable if each faulty event can be...
متن کامل